AITopics | concept token

Collaborating Authors

concept token

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Visual Concepts Tokenization

Neural Information Processing SystemsMar-20-2026, 06:13:30 GMT

Obtaining the human-like perception ability of abstracting visual concepts from concrete pixels has always been a fundamental and important target in machine learning research fields such as disentangled representation learning and scene decomposition. Towards this goal, we propose an unsupervised transformer-based Visual Concepts Tokenization framework, dubbed VCT, to perceive an image into a set of disentangled visual concept tokens, with each concept token responding to one type of independent visual concept. Particularly, to obtain these concept tokens, we only use cross-attention to extract visual information from the image tokens layer by layer without self-attention between concept tokens, preventing information leakage across concept tokens. We further propose a Concept Disentangling Loss to facilitate that different concept tokens represent independent visual concepts. The cross-attention and disentangling loss play the role of induction and mutual exclusion for the concept tokens, respectively. Extensive experiments on several popular datasets verify the effectiveness of VCT on the tasks of disentangled representation learning and scene decomposition. VCT achieves the state of the art results by a large margin.

artificial intelligence, concept token, machine learning, (7 more...)

Neural Information Processing Systems

Country: Asia > China > Guangxi Province > Nanning (0.07)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.78)

Add feedback

VisualConceptsTokenization Appendix

Neural Information Processing SystemsFeb-11-2026, 23:12:38 GMT

This is quite similar to what VCT can learn on the synthesized dataset Objects-Room. As the real-world dataset is more diverse, we observe several failure cases shown in Figure 8. We suppose those failure cases are due to VCT, trained withreconstruction loss,isnotgoodatsynthesizing counterfactual samples which arefarfromthe data distribution.

artificial intelligence, machine learning, padding, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

cd062f8003e38f55dcb93df55b2683d6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 23:12:31 GMT

concept token, representation, visual concept, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangxi Province > Nanning (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning

Neural Information Processing SystemsFeb-9-2026, 20:28:22 GMT

Can't wait to see the second movie! The first two lines are two demonstrations, and the third line is a test input.

demonstration, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Puerto Rico (0.04)
(11 more...)

Industry:

Health & Medicine (0.93)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (1.00)
(2 more...)

Add feedback

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement Tao Y ang

Neural Information Processing SystemsNov-19-2025, 22:14:03 GMT

InfoGAN-CR [23]), along with others [37, 28], have been proposed to advance this field further. This work was done during internship at Microsoft Research Asia.

artificial intelligence, machine learning, representation, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangxi Province > Nanning (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

A Framework for Quantifying How Pre-Training and Context Benefit In-Context Learning

Song, Bingqing, Li, Jiaxiang, Wang, Rong, Lu, Songtao, Hong, Mingyi

arXiv.org Artificial IntelligenceOct-28-2025

Pre-trained large language models have demonstrated a strong ability to learn from context, known as in-context learning (ICL). Despite a surge of recent applications that leverage such capabilities, it is by no means clear, at least theoretically, how the ICL capabilities arise, and in particular, what is the precise role played by key factors such as pre-training procedure as well as context construction. In this work, we propose a new framework to analyze the ICL performance, for a class of realistic settings, which includes network architectures, data encoding, data generation, and prompt construction process. As a first step, we construct a simple example with a one-layer transformer, and show an interesting result, namely when the pre-train data distribution is different from the query task distribution, a properly constructed context can shift the output distribution towards the query task distribution, in a quantifiable manner, leading to accurate prediction on the query topic. We then extend the findings in the previous step to a more general case, and derive the precise relationship between ICL performance, context length and the KL divergence between pre-train and query task distribution. Finally, we provide experiments to validate our theoretical results.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2510.22594

Country: North America > United States (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
(2 more...)

Add feedback

Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement Tao Y ang

Neural Information Processing SystemsOct-10-2025, 10:21:57 GMT

InfoGAN-CR [23]), along with others [37, 28], have been proposed to advance this field further. This work was done during internship at Microsoft Research Asia.

diffusion model, encdiff, representation, (14 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangxi Province > Nanning (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

3255a7554605a88800f4e120b3a929e1-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 10:11:01 GMT

concept token, demonstration, in-context learning, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Puerto Rico (0.04)
(11 more...)

Industry:

Health & Medicine (0.93)
Media > Film (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Concept-SAE: Active Causal Probing of Visual Model Behavior

Ding, Jianrong, Chen, Muxi, Zhao, Chenchen, Xu, Qiang

arXiv.org Artificial IntelligenceSep-29-2025

Standard Sparse Autoencoders (SAEs) excel at discovering a dictionary of a model's learned features, offering a powerful observational lens. However, the ambiguous and ungrounded nature of these features makes them unreliable instruments for the active, causal probing of model behavior. To solve this, we introduce Concept-SAE, a framework that forges semantically grounded concept tokens through a novel hybrid disentanglement strategy. We first quantitatively demonstrate that our dual-supervision approach produces tokens that are remarkably faithful and spatially localized, outperforming alternative methods in disentanglement. This validated fidelity enables two critical applications: (1) we probe the causal link between internal concepts and predictions via direct intervention, and (2) we probe the model's failure modes by systematically localizing adversarial vulnerabilities to specific layers. Concept-SAE provides a validated blueprint for moving beyond correlational interpretation to the mechanistic, causal probing of model behavior.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2509.22015

Genre: Research Report (1.00)

Technology: